Fix float 16 for cuda collectives by tsirif · Pull Request #414 · Theano/libgpuarray

tsirif · 2017-04-20T15:26:51Z

nouiz · 2017-04-20T15:30:24Z

  case GA_DOUBLE: return ncclDouble;
  case GA_LONG: return ncclInt64;
  case GA_ULONG: return ncclUint64;
+  #ifdef CUDA_HAS_HALF


I'm not sure what to do if CUDA_HAS_HALF isn't defined. I think it should always be defined as we only support recent enough cuda version. So I would just remove the ifdef. Do you agree with that?

otherwise, it look good. thanks.

This is defined in nccl.h and is enabled if there is compatibility. Should I remove it?

I remember that, but I think GA_HALF is always defined. So this cause a problem if libgpuarray support it while nccl don't. Here we don't give a good error message. Here, we request cuda 7 or more recent:

http://deeplearning.net/software/libgpuarray/installation.html#run-requirements

And CUDA_HAS_HALF is always true in that case. So I would remove the ifdef.

Yes don't use an ifdef switch since we load the libraries dynamically and we don't want to disable it for future loads.

nouiz · 2017-05-02T15:39:20Z

I pushed the small change to this PR. If travis pass, I'll merge.

thanks

abergeron · 2017-05-23T20:57:31Z

  case GA_LONG: return ncclInt64;
  case GA_ULONG: return ncclUint64;
+  case GA_HALF: return ncclHalf;
+  case GA_FLOAT16: return ncclHalf;


GA_FLOAT16 means a vector of 16 floats. This is horribly inaccurate.

I had asked on #411 about the types. Excuse me for the inconvenience.

Convert GA_FLOAT16 to ncclHalf in cuda collectives

bb24874

tsirif changed the title ~~Fix float 16 for cuda collectives [#411]~~ Fix float 16 for cuda collectives Apr 20, 2017

nouiz reviewed Apr 20, 2017

View reviewed changes

Remove ifdef as all supported version have it.

c11b0b3

nouiz approved these changes May 2, 2017

View reviewed changes

nouiz merged commit e8e6f87 into Theano:master May 2, 2017

abergeron reviewed May 23, 2017

View reviewed changes

tsirif deleted the fix/float_16_collectives branch September 12, 2017 15:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix float 16 for cuda collectives#414

Fix float 16 for cuda collectives#414
nouiz merged 2 commits intoTheano:masterfrom
tsirif:fix/float_16_collectives

tsirif commented Apr 20, 2017 •

edited

Loading

Uh oh!

nouiz Apr 20, 2017

Uh oh!

tsirif Apr 20, 2017

Uh oh!

nouiz Apr 20, 2017

Uh oh!

abergeron Apr 21, 2017

Uh oh!

nouiz commented May 2, 2017

Uh oh!

abergeron May 23, 2017

Uh oh!

tsirif May 23, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tsirif commented Apr 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nouiz Apr 20, 2017

Choose a reason for hiding this comment

Uh oh!

tsirif Apr 20, 2017

Choose a reason for hiding this comment

Uh oh!

nouiz Apr 20, 2017

Choose a reason for hiding this comment

Uh oh!

abergeron Apr 21, 2017

Choose a reason for hiding this comment

Uh oh!

nouiz commented May 2, 2017

Uh oh!

abergeron May 23, 2017

Choose a reason for hiding this comment

Uh oh!

tsirif May 23, 2017

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tsirif commented Apr 20, 2017 •

edited

Loading